A statistical model for word discovery in child directed speech
نویسنده
چکیده
A statistical model for segmentation and word discovery in child directed speech is presented. An incremental unsupervised learning algorithm to infer word boundaries based on this model is described and results of empirical tests showing that the algorithm is competitive with other models that have been used for similar tasks are also presented.
منابع مشابه
Statistical Speech Segmentation and Word Learning in Parallel: Scaffolding from Child-Directed Speech
In order to acquire their native languages, children must learn richly structured systems with regularities at multiple levels. While structure at different levels could be learned serially, e.g., speech segmentation coming before word-object mapping, redundancies across levels make parallel learning more efficient. For instance, a series of syllables is likely to be a word not only because of ...
متن کاملA Statistical Model for Word Discovery in Transcribed Speech
English speech lacks the acoustic analog of blank spaces that people are accustomed to seeing between words in written text. Discovering words in continuous spoken speech then is an interesting problem that has been treated at length in the literature. The issue is particularly prominent in the parsing of written text in languages that do not explicitly include spaces between words, and in the ...
متن کاملMAP Lexicon is Useful for Segmentation and Word Discovery in Child Directed Speech
An efficient algorithm for segmenting child-directed speech into words has recently been proposed in the Machine Learning journal. This short technical note proposes some modifications to this algorithm. In particular, a slightly more conservative variation of the original approach is proposed that infers word boundaries based simply on the maximum a-posteriori lexicon. Results of empirical tes...
متن کاملPhonetic variation in consonants in infant-directed and adult-directed speech: the case of regressive place assimilation in word-final alveolar stops.
Pronunciation variation is under-studied in infant-directed speech, particularly for consonants. Regressive place assimilation involves a word-final alveolar stop taking the place of articulation of a following word-initial consonant. We investigated pronunciation variation in word-final alveolar stop consonants in storybooks read by forty-eight mothers in adult-directed or infant-directed styl...
متن کاملThe Neglected Universals: Learnability Constraints and Discourse Cues
words: 60 Text words: 967 References words: 450 Total words: 1477 The Neglected Universals: Learnability Constraints and Discourse Cues Heidi Waterfall Dept. of Psychology, Cornell University Ithaca, NY 14853, USA and Dept. of Psychology, University of Chicago Chicago, IL 60637, USA [email protected] Shimon Edelman Dept. of Psychology, Cornell University Ithaca, NY 14853, USA and Dept. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره cs.CL/9910011 شماره
صفحات -
تاریخ انتشار 1999